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Abstract 



We consider rank modulation codes for flash memories that allow for handling arbitrary charge drop 
errors. Unlike classical rank modulation codes used for correcting errors that manifest themselves as 
swaps of two adjacently ranked elements, the proposed translocation rank codes account for more general 
C/3 , forms of errors that arise in storage systems. Translocations represent a natural extension of the notion 

O ■ of adjacent transpositions and as such may be analyzed using related concepts in combinatorics and 

rank modulation coding. Our results include tight bounds on the capacity of translocation rank codes, 
construction techniques for asymptotically good codes, as well as simple decoding methods for one class 
^ ■ of structured codes. As part of our exposition, we also highlight the close connections between the new 

^SJ , code family and permutations with short common subsequences, deletion and insertion error-correcting 

CO ■ codes for permutations and permutation arrays. 

p 

^ ■ 1 Introduction 
(N 

Permutation codes and permutation arrays are collections of suitably chosen codewords from the symmetric 
group, used in applications as varied as single user communication over Gaussian channels |29[|20| . reduction 
, of impulsive noise over power-lines [H 12] , rank aggregation [5S1 [23] and coding for storage [5] . Many 

5_j ■ instances of permutation-based codes were studied in the coding theory literature, with special emphasis on 

, permutation arrays under the Hamming distance and rank modulation codes under the Kendall r distance |22l 

[TT1[5][T21 Chapter 6B]. The distances used for code construction in storage devices have mostly focused around 
two families of combinatorial measures: adjacent and arbitrary transpositions and measures obtained via 
embeddings into the Hamming space [5]|4]. This is due to the fact that such distance measures capture the 
displacement of symbols in retrieved messages that arise in modern storage systems. 

One of the most prominent emerging applications of permutation codes in storage is rank modulation. 
Rank modulation is an encoding scheme for flash memories that may improve the lifespan, storage efficiency 
and reliability of future generations of these storage devices |18l [SJ [311 [IH] ■ The idea behind the modulation 
scheme is that information should be stored in the form of rankings of the cells charges, rather than in terms 
of the absolute values of the charges. This simple conceptual coding framework may reduce the problem of 
cell block erasures as well as potential cell over-injection issues |18[ I34| . In their original formulation, rank- 
modulation codes represent a family of codes capable of handling errors of the form of adjacent transpositions. 
Such transposition errors represent the most likely errors in a system where the cells have nearly-uniform 
leakage rates. Leakage rates depend on the charge of the cells, the position of the cells but also on a number 
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of external factors, the influence of which may not be adequately captured by adjacent transposition errors. 
For example, sudden and steep drops in the charge of one group of cells cannot be modeled in this framework. 

In what follows, we present a novel approach to rank modulation coding which allows for correcting a 
more varied class of errors when compared to classical schemes. The focal point of the study that follows is 
the notion of a translocation error^ concept that generalizes the notion of a transposition in a permutation. 
Roughly speaking, a translocatioro moves the ranking of one particular element in the permutation below 
the ranking of a certain number of closest-ranked elements. As such, translocations are suitable for modeling 
errors that arise in flash memory systems were sudden drops of cell charges or high leakage levels for subsets of 
cells are expected or possible. A translocation may be viewed as an extension of an adjacent transposition to 
a transposition of elements a certain number of locations apart. In addition, translocation correspond to pairs 
of deletions and insertions of an element in the permutation. As a consequence, the study of translocations is 
also closely related to a number of problems in combinatorial theory such as the longest common subsequence 
problem and permutation coding under the Levenshtein and Hamming metric ^351 [T71 [23] • 

The rank modulation problem is by now fairly well understood from the perspective of code construction. 
The capacity of rank- modulated channel was derived in [SJ [5^1 [I] , while some practical code constructions 
were proposed in [SJ[THj, and further generalized in [5S], [TS] and [33]. Here, we complement the described 
work in terms of deriving upper and lower bounds on the capacity of translocation rank modulation codes, 
and in terms of presenting constructive, asymptotically good coding schemes. Our constructions are based 
on a novel idea of permutation interleaving, and are of independent interest in combinatorics and algebra. 
Furthermore, we propose decoding algorithms for translocation codes based on decoders for codes in the 
Hamming metric [301 133] ■ As part of our analysis, we describe the close relationships between permutation 
codes and arrays under a number of performance metrics. 

The paper is organized as follows. In Section [5] we provide the motivation for studying translocations 
as well as basic definitions used in our analysis. The properties of permutations under translocations are 
studied in the same section while bounds on the size of the codes are presented in |3l Code constructions are 
presented in Sections |4] and Section [Sj while concluding results are presented in Section [6l 



2 Basic definitions 

Throughout the paper, we use the following notation. The set of nonnegative integers is denoted by ZJ, 
while [n] := {1,2, •••,«} stands for the set of non-negative integers not exceeding n. 

A permutation is a bijection a : [n] ->■ [n], that is, for any «,j e [n], i + j, we have a{i) + cr(j'). For 
any tr € S„, we write a = (CT(l),cr(2),-",i7(n)), where a{i) is the image of i e [n] under the permutation 
a. Alternatively, we may write a as [(7(l),(T(2),---,CT(n)] or as [(T(l)cr(2)---t7(n)] if doing so does not cause 
ambiguity. The identity permutation (l,2,---,ri) is denoted by e, while a^^ stands for the inverse of the 
permutation cr. 

Permutations are denoted by Greek lowercase letters, while integers and integer vectors are denoted by 
Latin lower case symbols. 

We let S„ denote the set of all permutations of the set [n], i.e., the symmetric group of order n. Addi- 
tionally, let §(P) ,P £ [n], be defined as the set of all permutations of elements of P. 

A permutation code of length n and minimum distance d in a metric d is subset C of §„ such that for all 
distinct tt, cr e C, we have d(7r, cr) > d. 

^Note that our definition of the term translocation differs from a definition of this term commonly used in biology. See, e.g., 
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[ 6 3"8 1 5 9 2 4 7 ] >► [683159247] 

Figure 1: Rank modulation codes and adjacent transposition errors. 

Definition 1. We say that a permutation t e §„ is a transposition if for i,j e [n], 

r = {l,---,i-l,j,i + l,---,j,i,j + 
If J = i + 1 , then r is called an adjacent transposition. 

Assume that 1 < i,j < n. A translocation (j){i,j) is a permutation defined as follows. If z < j, we have 

Hhj) = (lr--,i-l,i + l,« + 2,---,j,i,i + l, ••■,"-) 

and if i > J , we have 

HiJ) = (lr--,J-l,«,j,J + lr--,«-l,« + l,---,»^) • 

For i < j, the permutation (t>{i,j) is called a right translocation and the permutation 4>{j,i) is called a left 
translocation. 

Example 2. Assuming i < j, for a = (cr(l),---,(T(n)), we have 

o-T = (CT(l),---,cr(i- l),a{j),(T{i + l),---,a{j - l),a{i),a{j + l),---,cr{n)). 

and 

'^0(*,i) = Kl),'^(2),-,a(i-l),a(z+l),<7(z + 2),-,a(i),a(z),aO- + l),-,a(n)) . 
cr(f,{j,i) = {cr{l),a{2),-,a{i-l),cr{j),cr{i),a{i + l),-,cr{j -l),a{j +l),-,a{n)) . 
In particular, let a= (1,3,5,7,2,4,6,8). For this case, 

0-0(3,6) = (1,3,7,2,4,5,6,8) and cr(?!)(5, 2) = (1, 2, 3, 5, 7, 4, 6, 8) . 

□ 

Simply put, the product of the permutation a and a translocation 4'{i,j) is the permutation obtained by 
moving a{i) to the j-th position and shifting all the elements between the i-th and j-th positions. Observe 
that the inverse of the left translocation (j){i,j) is the right translocation (j){j,i), and vice versa. 

Our interest in translocations in permutations is motivated by rank modulation coding, as illustrated 
by the examples depicted in Figures [1] and [2j In classical multi-level flash memories, each cell used for 
storing information is subjected to errors. As a result, classical error control schemes of non-zero rate can 
not be efficiently used in such systems. One solution to the problem is to encode the information in terms of 
rankings [22], rather than absolute values of the stored information sequences. As a result, data is represented 
by permutations and errors manifest themselves via reordering of the ordered elements. The simplest model 
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[ 6 3^JL5924 7 ] >► [681592437] 

Figure 2: Rank modulation codes and translocation errors caused by "large" drops of charge levels. 



assumes that only adjacent ranks may be exchanged - or equivalently, that only adjacent positions in the 
inverse permutation may be exchanged. 

This model has the drawback that it does not account for more general changes in ranks. With respect to 
this observation, consider the charge-drop model in Figure[21 Here, cell number 3, ranked second, experienced 
a leakage rate sufficiently high to move the cells ranking to the seventh position. If other cells had significantly 
more moderate leakage rates, the resulting ranking would be the result of a (right) translocation error. It is 
easy to note that in this framework, one translocation of "length" \i - j\ corresponds to \i - j\ adjacent 

transpositions. Nevertheless, a translocation should be counted as a single error, and not a sequence of 
adjacent transposition errors. 

The translocation error model may appear to be too broad to describe the phenomena arising in flash 
memories, as error models corresponding to translocations of small length arise more frequently than translo- 
cations of long length. The idea of bounded length translocations will be pursuit further in a companion 
paper. We only remark that translocation errors of arbitrary length are of interest since they are suitable 
for accounting for arbitrary charge drops of any cell independently of the drops of other cells. This makes 
them a good candidate for studying new error-control schemes in flash memories. 

We formalize the notion of translocation distance below. 

Definition 3. Let cr, tt e §„. The translocation distance between a and tt is a function 

do (cr, tt) : S„ X §„ ^ Zq , 
defined as the minimum number of translocations needed to transform cr to tt. 



In other words, do (cr, tt) equals the smallest m such that there exist a sequence of translocations 02, 0; 
for which tt = o'(j3i(j)2---(t>m- 

Observe that the function do (•,■) is nonnegative and symmetric. It also satisfies the triangle inequality, 
namely for any cri , (72 and a 3 in §„ , one has 

do (cTljCTs) < do (cri,CT2) +do (CT2,Cr3) . 

Therefore, it is indeed a metric over the space §„. 

Note that a translocation may correspond to either a left or a right translocation. As seen from the 
example in Figure [21 right translocations correspond to general cell leakage models. On the other hand, left 
translocation assume that the charge of a cell is increased above the level of other cells, which corresponds 
to a phenomena not frequently encountered in flash memories. We therefore also introduce the notion of the 
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right translocation distance. As will be seen from our subsequent discussion, the translocation distance is 
much easier to analyze than the right translocation distance, and represents a natural lower bound for this 
distance. 

Definition 4. A subsequence of length moia = (<7(1), . . . , cr(n)) is a sequence of the form cr(ii), . . . , a{im), 
with ii < 12 < . . . im- Let ai,a2 e §«• The longest common subsequence of cti and (T2 is a subsequence of both 
(Ti and CT2 of longest length. We denote this sequence by L (ai, CT2), and its length by I (cri, (72). 

Definition 5. Let cr e S„ and let I (a) ■■= I {a, e). The function / (a) is termed the length of the longest 
increasing subsequence of a. 

Definition 6. Let Rq denote the set of nonnegative real numbers. A metric d : §„ x S„ ^ Mq is right- 
invariant if, for all 7r,tT, cj e S„, we have d(7r,cr) = d(7rcLi, crcj). Similarly, d is left-invariant if d(7r,CT) = 
d(w7r, wtt). 

Lituitively, a right-invariant metric is invariant with respect to reordering of elements and a left-invariant 
metric is invariant with respect to relabeling of elements. Furthermore, if d(7r,CT) is a right-invariant metric, 
then it is applied to 7r(j) and a{i) and if it is a left-invariant metric, it is applied to TT^^{i) and (T^^{i). It 
can be easily verified that the translocation distance is a left-invariant metric. The length of the longest 
common subsequence is also easily seen to be left-invariant. 

Proposition 7. For 7r,a e S„, the distance do (tt, cr) equals n - I (tt, cr) . 

Proof. By the left-invariance of do and I, we may assume that one of the permutations is the identity 
permutation e since otherwise, instead of do (tt, cr) = n-l (tt, cr) , we can show that do (^a~^Tr, e) = n-l (cr~^7r) . 
It thus suffices to prove that do {<T,e) = n-l (ct). 

Let S( denote the set of elements in the longest increasing subsequence of the permutation cr. Clearly, 
it is possible to transform a to e with at most n - /(cr) translocations. This can be achieved by applying 
translocation that move one element from the set to its position in the identity permutation e. Hence, 

do {cr, e) < n - I (cr). 

Next, we show that do (cr, e) > n - I (cr). We start with cr and transform it to e by applying a sequence 
of translocation. Every translocation increases the length of the longest increasing subsequence by at most 
one. Hence, we need at least n - I (cr) translocation to transform cr into e and thus, do (cr, e) > n - I (cr). □ 

Since the translocation distance between two permutations is a function of the length of their longest 
common subsequence, there exists an interesting connection between the translocation distance and Lev- 
enshtein's edit distance for permutations [24]. The latter distance is used for measuring the influence of 
deletions and insertions on the "similarity" of two permutations. Formally, it may be defined as follows. 

Let the number of deletions and insertions required to transform u into v be denoted by p(u,v). Leven- 
shtein [24] showed that, for sequences of length n over [n], p{u, v) = 2{n- l{u, v)) . This equality also holds 
for permutations over §„ and thus 

P(cri,cr2) = 2do (cri,cr2) . 

This result may be also deduced directly, by observing that a translocation (t>{i,j) in a permutation a deletes 
the element cr^ and inserts the same element in location j. 

It is also of interest to see how the translocation distance compares to the Kendall's r distance used for 
classical rank modulation coding. The Kendall's t distance d7-(cr, tt) from cr e §„ to tt e §„ is defined as 
the minimum number of adjacent transpositions required to change cr into tt. A distance measure related 
to the Kendall's r distance is the transposition distance, dT(7r,CT). The transposition distance between 
two permutations tt and cr equals the smallest number of transpositions (not necessarily adjacent) needed 
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to transform tt into a. The transpositfon distance (JtCtt, a), as shown by Cayley in the 1860s, equals the 
number of cycles in the permutation cttt"^. 

Since a translocation of length t requires at most I - 1 adjacent transpositions, and since an adjacent 
transposition is a translocation, it is easy to see that 

-d^(cr,7r) < do (a-,7r) < dT-(0-,7r). 
n-1 

Both the upper and lower bound are tight: the upper bound is achieved for tt obtained from a via a single 

adjacent transposition, while the lower bound is achieved for, say, a = e and tt = (2,3, . . . ,n, 1). It is also 
straightforward to show that the diameter of the translocation distance equals n-1. Observe that the 
above inequalities imply that the translocation distance is not within a constant factor from the Kendall's 
T distance, so that code constructions and bounds specifically designed for the latter distance measure are 
not tight and sufficiently efficient with respect to the translocation distance. 

A similar pair of bounds may be shown to hold for the translocation distance and the Hamming distance 
between two permutations. The Hamming distance between two permutations cr,7r, di/(cr, tt) is defined as 
n- |F(cr,7r)|, where F(a,Tr) denotes the set of fixed points in a under the ordering of tt (and vice versa). 

The subsequence of <t consisting of elements of F((T, tt) is also a subsequence of n and thus do(iT, tt) = 
n - 1{(7,tt) < n - |F((T, 7r)| = dif((j, tt). Furthermore, since for any two permutations n,a e S„ one has 
di/(7r,a) < n, it follows that di/(7r,a) < ndo(7r,c7). Thus, 

-dH(7r,CT) < do (7r,(T) < djf (tTjct) . (1) 
n 

These inequalities are sharp. For the upper-bound, consider tt = [12-"n] and a = [Ti- -21], with n odd. For 
the lower-bound, let tt = [12"-n] and a = [23-"nl] so that dij(7r,cr) = n and do(7r, cr) = 1. 

Next, we consider the Cayley metric. Note that each transposition may be viewed as two translocations, 
implying do(7r,(7) < 2dT(cr,7r). It is also immediate that ^^dT(7r, cr) < do(7r, cr). Hence, we have 

-dT(7r,a) < do(7r,0-) < 2dT(CT,7r). 
n-1 

The relationship between the Hamming distance and Cayley's distance can be explained as follows. When 
transforming tt to cr using transpositions, each transposition decreases the Hamming distance between the 
two permutations by at most two. Hence, dT(7i', cr) > di/(7r,cr)/2. Sorting a permutation of length d requires 
at most d-1 transpositions. Thus, driT^, cr) < dH(7i', cr) - 1. These inequalities result in 

idff(7r,Cr)<dT(7r,Cr)<dff(7r,Cr)-l. (2) 

There exist many embedding methods for permutations, allowing one set of permutations with desirable 
properties according to a given distance to be mapped into another set of permutations with good properties 
in another metric space [3]. In subsequent section, we exhibit a method for embedding permutations with 
good Hamming distance into the space of permutations with large minimum translocation distance. 

2.1 Asymmetric translocation distance 

We describe next how to specialize the translocation distance for the case that only right-translocations are 
allowed to occur. 
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Definition 8. Let tt, cr e S„ and denote by -R* (tt, a) the minimum number of right translocation required 
to transform tt into a. For two permutations 771,772 e §„ , the right translocation distance do (tti , 772 ) is defined 
as 

do (771,772) = 2min{max{iit(77i,cr),i?t(772,CT)}} . 

cr 

The definition implies that for two permutations 771 e S„ and 772 e S„, one has do (771, 772) < 2t if and only 
if there exists a permutation a € such that -Rt(77i,cr) < t and Rt{n2,(T) < t. Hence, a code C is t-right 
translocation correcting if and only if do (771,772) > 2t for all 771,772 e C, 771 * 772. This means that under 
the given distance constraint, it is not possible to confuse the actual codeword 771 with another (wrong) 
codeword 772. 

Observe that the following bound holds: 

do (771,772) < do (771,772) . 

It is straightforward to characterize the minimum number of right translocations needed to transform one 
permutation to another, as we show below. 

Definition 9. Let 77, a e S„. We denote the set of 77" ^-increasing and cr"^-decreasing elements as 

J (77, a) := [i e [n] : 3j s.t. 77"^ (i) < (j) and (i) > (j)] . 
Lemma 10. iei77,0-eS„. Then 

iit (77,(7) = |J(77,a)| . 

Proof. It can be easily verified that Rt is left-invariant. So it suffices to show that 

i?t(77,e) = I J (77,e)| , 

where 

|J(77,e)| = \{i € [n] •■ 3j < i.-k'^ (i) < 77""^(j)}| . 

Let 77i be obtained from 77 by applying a right translocation. Then, it is straightforward to see that | J(77i , e)| > 
I J(77, e)| - L Thus, i?t(77, e) > | J (77, e)| - | J (e, e)| = | J (77, e)|. 

Conversely, to transform 77 into e, it suffices to apply the shortest right translocation to each i e J that 
moves i to position i' such that to the left of positions i' there are only elements smaller than i. Hence, 
i?t(77,e)<|J(77,e)|. □ 

For permutations 77,cr e §„, the difference between Rt(Tr,a) and do(77,<7) may be as large as n - 2. This 
may be seen by letting 77 = [23... nl] and a = e, and observing that Rt{TT,(7) = n -1 and do(77,cr) = 1. 
Furthermore, it can be shown that this is the largest possible gap. To see this, first, note that Rt{n,a) = 
if and only if do(77, a) = and thus to obtain a positive gap one must have do(77, a) > 1. We also have 
do(77, cr) < Ri{Tr,a) < n-1. Hence, 1 < do(77, u) < Rt{-K,(j) < n - 1, which implies that the gap is at most 
n-l-l = n-2. 

In the sections to follow, we focus our attention on the symmetric translocation distance. For simplicity, 
we omit the word symmetric. 
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3 Bounds on the size of the code 



3.1 Codes in translocation distance 



Let Aa (n, d) be the maximum size of a code of length n and minimum translocation distance d. Let 
Co(n) := denote the rate of the code. 

^ ' Inn! 

Proposition 11. For all integers n and d with n> d> I, 

{n-d+iy. 



A„{n,d) > 



Proof. Let Ba{r) be the number of permutations with translocation distance at most r from a given permu- 
tation. By symmetry, we have Bo{r) = {a : do (ti'ic) < r}. The permutations that are within distance r of tt 
are precisely the permutations a with I (7r,cr) > n - r. There are (") ways to choose the first n - r elements 
of the longest common subsequence of tt and a and at most ^J^^^, ways to arrange the remaining elements. 
Hence, 



(n - r)l 

From the Gilbert- Varshamov bound, we have Ao (n, d) > g (d-i) ^^^"^ thxLS 

n' 

Ao{n,d) > ^, 

\d-l) (n-d+1)! 

which completes the proof. □ 
Proposition 12. For all n,d eZ with n> d> 1, 

Ao {n,d) < {n-d+ 1)1 . 

Proof. We provide two proofs for this bound. The first proof is based on a projection argument first described 
in [Ij, while the second is based on a standard counting arguments associated with the proof of Plotkin's 
boun41. 

For cr 6 §„ and P £ [n], the projection ap of a onto P is obtained from a by only keeping elements of P 
and removing the other elements. For example, let a = [54321], and P = {2,4,5}. We have ap = [542]. Note 
that ap has length \P\. 

1 ) Let C be a code of length n, size M, and minimum distance d. Let k be the smallest integer such that 
'^{i,...,k+i} '''{i,...,/c+i} for 8-11 distinct a^i: e C. Hence, M < (k + 1)!. By definition, there exist a,?: 6 C such 
that a^i j^y = 77^1 f^y. So, / (cri, (T2) ^ k and thus d < do (cti, CT2) < n - k. Hence, M < (n-d+l)!. 



2) Again, let C be a code of length n, size A/, and minimum distance d. Since the minimum distance 

n! 
(d-l)! 



is cZ, all subsequences of length n - d + I of codewords of C are unique. There are jj^nj possible 



subsequences of length n - d + 1. Hence, 

m( ^ ) 



(d-iy. 

which implies that M < (n - d + 1) !. □ 



The authors gratefully acknowledge a number of discussions with Jalal Etesami. 
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From the two previous propositions, we obtain 

(n-d+iy. 



<Ao{n,d)<{n-d+iy. (3) 



All limits that appear in the rest of the paper are in n unless otherwise stated. The results below are 
stated for code families with length n, M(n) codewords, and minimum translocation distance d(n). The 
ratio <i(n)ln is denoted by 6(n). Furthermore, \im 5(n) and limCo(n) are denoted by 6 and Co respectively. 

Lemma 13. We have that 



,^Hn-din)V.__^_^^ 
Inn! 



3. 



Proof. 



In- 



Inn! 



lini_l^=0. 
Inn! 



ln(n-d(n))! _ (n - d(n)) In (n - d{n)) + O (n) 
Inn! nlnn + 0(n) 

nlnn - d{n) Inn + (n - d{n)) ln(l - d{n)/n) + O (n) 



n 



nlnn + O (n) 



d(ny. _ Inn! -lnrf(n)! 
Inn! Inn! 

d{n) \nd + 0{n) 



1- 



^ din)lnd{n) ^ ^ 

nlnn 



nlnn + O (n) 

(-) 

\ Inn/ 

(ii^) 



^ (i(n) Inn + d(n) In ((i(n)/n) 
nlnn 

1-^.0 



3. From parts 1 and 2, it follows that 



Inn! Inn! 



(inn) 



□ 
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The following theorem provides expressions for the asymptotic capacity of the translocation channel. 

Theorem 14. Let C he a family of translocation correcting codes of length n, distance d{n) . Then 
Co = 1-6. 

Proof. From ([3]), we have 

ln(n-d+l)!-ln(;_\) ln(n-d+l)! 

] i <Co(n)< j (4) 

mnl Inn! 

Taking the limit of Q by using Lemma [T3] yields 

Co = limCo(n) = l-lim(5(n) = l-(5. (5) 

□ 

At this point, it is worth observing that the problem of bounding the longest common subsequence in 
permutations has been recently studied in a combinatorial framework in [2]. In this framework, the question 
of interest is to determine the minimum length of the longest common subsequence between any two distinct 
permutations in a set of k permutations of length n. When translated into the terminology of translocation 
coding, the problem reduces to finding dk (n), the largest possible minimum translocation distance of a set 
of k permutations of S„ . 

The bounds derived in [2] are constructive, but they hold only in the zero-capacity domain of parameters. 
A more detailed description of this construction is described in the next section. The bounds of fl\ imply 
that dk (n) > n - 32 (nk)^^^ for 3 < fc < s/n. Hence, for n - 32\/n < d < n - 32 (3n)^^'^, 

Furthermore, for k> A, dk (n) > n - ^n^^^''^^^'\ ^ . For fc > 2 (1 + log2 n), this bound is of no practical use. 

For 1 + log2 n < fc < 2 (1 + log2 n), one has dk (n) > n - 2*^^^"^ which implies that, 

A„{n,d) >2(l + log2(n-d)) 

for d <n - \/n/2. Similar bounds can be obtained for Ao{n,d) by assuming that m -1 < n'^ < m for some 
integer m < [n^'''^]. Note that although these results hold for the zero-capacity regime, they still may be 
useful for finite codelength. 

Remark: Similar bounds may be derived for asymmetric translocation error-correcting codes. For this 
purpose, let B'{r) = \{a : Rt{e, a) < r}\ and B{r) = \{a : do (e, a) < r}\. Then 



— ^ ^ A (n, 2t) < — — 

Bo{2t) B{2t) ^ ' B'{t) 



where A(n,d) denotes the maximum size of a permutation code with minimum right-translocation distance 
d. 



3.2 Relationships among codes with translocation, substitution, deletion/insertion 
and transposition errors 

Translocation errors, and consequently, translocation error correcting codes are difficult to analyze directly. 
As already pointed out, the translocation distance is related to various other metrics well-studied in the 
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coding theory and mathematics literature. Since the constructions in subsequent sections rely on code 
constructions for other distance metrics, we provide a brief overview of the state of the art results pertaining 
to the Hamming, transposition and Kendall r metric. We also supplement the known findings with a number 
of new comparative results for the metrics of interest. 



3.2.1 Hamming metric 

Codes in the Hamming metric have a long history, dating back to the work [13]. The Hamming metric is 
a suitable distance measure for use in power line communication systems, database management and other 
applications. 

Let Ah (n, d) denote the largest number of permutations of length n and Hamming distance d. Frankl 
and Deza [131 Theorem 4] and Deza [lOJ showed that 

< Ah {n,d) < 



Bnid-l) - ' ' - {d-l)\ 

where Bh {t) is the volume of the sphere of radius r in the space of permutations with Hamming metric. 

Let Di denote the number of derrangments of i objects, i.e., the number of permutation of i objects at 
Hamming distance i from the identity permutation. It can be shown that Bh (r) = 1 + 'Z\=2 (")^*- Hence, 

d-l /„\ d-1 



i3H(d-l) = l+E A<E 

i=2 \ * ' i=l 



{n-i)l 

'^'-'^(^^^dTTy. 

where the first inequality follows from the fact that Di < i\. Note that although a more precise asymptotic 
characterization for the number of derrangements is known, namely 

lim — = - 



1 1 



the simple bound Di < i\ provides sufficiently tight bound for the capacity computation. 
The above results show that 

(n-d+l)l . , nl 

< Ah (n,d) < 

d-l ^ ' ' (d-l)! 

Lemma [T3] implies the following theorem. As before, the result holds for a family of codes C under the 
Hamming distance metric, with parameters n,d and S{n). 

Theorem 15. If S{n) = d{n)l^ denotes the normalized Hamming distance, then the capacity of the Ham- 
ming permutation channel equals Ch = 1 - lim(5(n). 



3.2.2 Transposition metric 

Let At {n, d) denote the maximum size of a code with minimum transposition distance at least d. From 
0, we have 

Ah {n, 2d) < At {n, d) < Ah {n, d-l) . 
The upper-bound and a sharper lower bound may also be derived from the following proposition. 
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Proposition 16. For all n and d, 



J}} 

< At {n,d) < — 



where s (n, k) is the Stirling number of the first kind. 

Proof. Let C be the largest possible code of length n with minimum transposition distance d, and with M 
codewords. Furthermore, let 7,„ : ^ §m be a mapping such that 7™ (a) = (cr (1) , cr (2) , . . . , cr (m)). Let k 
be the smallest integer such that jk+i is injective when applied to C. Then, we have M < n (n - 1) ■■■ (n - k) = 
(w-fc-i)! • Si^ce, 7fe is not injective, there exist permutations ai,a2 e C such that jk (ci) = jk {'^2)- Hence, 
d < (di, (T2) < n - fc - 1 and thus M < ^. 

To prove the lower bound, note that 

where Bx (r) is the size the ball {tt : (tt, e) < r} . It is well-known that (tt, e) = n - c (tt) , where c (tt) 
denotes the number of cycles of tt, and that the number of permutations with k cycles equals (-1)" s {n, k) . 
Hence, {n : dr (tt, e) < d - 1} = {tt : c(7r) >n-d+l} and consequently Bt (d-l) = T,i=n-d+i * * (^j 0- 

□ 

Using the bounds above, we have the following theorem. 
Theorem 17. The capacity Ct of the transposition channel is bounded as 

1 -21im(5(n) <Ct< 1 -lim(5(n). 

3.2.3 Kendall's t metric 

Let AK{n,d) denote the largest cardinality of a permutation code of length n with minimum Kendal's tau 
distance d, and let Ck = \im^'^ {n,d)li^j^i^ Barg and Mazumdar [1] Theorem 3.1] showed that 

CK = l-e, for d=Q{n^^'). 

Note that for the Kendall r metric, the maximum distance between two permutations may be as large as 
0{n^). On the other hand, the diameter of the translocation distance is 0{n). 

3.2.4 Levenshtein metric 

The bounds on the size of deletion/insertion correcting codes over permutations were first derived by Leven- 
shtein in his landmark paper |24J. The lower bound relies on the use of Steiner triple systems and designs [24j. 
More precisely, let D{n, q) be the largest cardinality of a set of n-subsets of the set {0, 1, . . . , q - 1} with the 
property that every (n - l)-subset of {0, 1, . . . , g - 1} is a subset of at most one of the n-subsets. Then 
the following results holds for the cardinality AL{n,q) of the largest single-deletion correcting codes over 
permutations in Sg j24j : 

{n-iy.D{n,q) <ALin,q) < 



n (q - n + ly. 

For a more thorough discussion regarding perfect permutation codes in the Levenshtein metric over §g, 
capable of correcting n-t errors and ordered Steiner systems S{t,q,q), the interested reader is referred 
to t24j. 
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4 Single error correcting codes for translocations and right-translocations 



This section contains constructions for single-translocation error detecting and single-translocation error 
correcting codes. For the latter case, we exhibit two constructions, one for symmetric translocations and 
another for right-translocations only. The constructions are based on codes in the Levenshtein and Hamming 
metric. 

We start by describing how to construct single-translocation detecting codes based on single-deletion 
correcting codes for permutations. 

From the discussion in Section [51 recall that the translocation distance is half of the deletion distance 
and thus any single-deletion correcting code may be used for detecting a single translocation. An elegant 
construction for single-deletion correcting codes for permutation was described by Levenstein in j24j. The 
resulting code has cardinality (n- 1)! and is optimal, since, from Proposition I12( we have 

A (n,2)<(n-2 + l)! = (n-l)!. 

Hence, A„ (n,2) = (n- 1)!. 

Levenstein's construction is of the following form. 
For a € {0, 1, n}, let 

= lue {0,1}" ■.{n + l)\f^zu{i) 
I i=i 

where a \ b denotes that a is a divisor of b. For a e S„, let Z (a) = {z (1) z {n - 1)) be a vector with 

z(z) = |' 
Levenstein shows that the code 

C = {o- 6 S„ : Z (a) 6 W^- 

of size (n - 1)! is capable of correcting a single deletion. Hence, 
error as well. 

Let Sn^*-* be the set of permutations a of length n-t that can be obtained from some cj e §„ by t deletions. 
In other words, §„^*' is the set of words of length n-t from the alphabet [n], without repetitions. A code 
C £ S„ is a perfect code capable of correcting t deletions if, for every a e Sn^*-*, there exists a unique tt e C 
such that a can be obtained from tt by i deletions. 

It was shown in |24] that C in ([6]) is a perfect code. Since the size of Sj*^ equals („"J(?i-t)! and 
(") elements of Sn^'-* can be obtained by t deletions from each a e C, we have that |C| = {n-t)l. Thus a 
perfect code capable of correcting t deletions, if it exists, is a code with minumum translocation distance 
t + I. Hence, if such a code exists, it is an optimal code for translocation errors. 

Next, we describe codes capable of correcting a single translocation error and codes capable of correcting 
a single right translocation error. The sizes of these two codes are identical, although the latter family of 
codes has a 1.5 higher rate. In the construction, we make use of a single-transposition error detecting code, 
described below. 

A single-transposition error detecting code: For (Ji,(J2 e let dT{(Ji,o-2) as before denote the 
transposition distance between cti and CT2 • The parity of a transposition a is defined as the parity of dr (c, e) . 



if a{i) <a{i + l) , 
if a{i) > a{i + l) , 



i e[n-l]. 



(6) 

this code can detect a single translocation 



13 



It is well-known that applying a transposition to a permutation changes theparity of the permutation, and 
also that, for n > 2. half of the permutations in S„ are even and half are odd|f|. Hence, the code C containing 
all even permutations of is a single-transposition error detecting code of length n and cardinality §„/2. 



4.1 Right-translocation errors 

In this subsection, we present a construction for codes that correct a single right-translocation error. For 
this purpose, we first define the operation of interleaving vectors (permutations) and codes. 

Definition 18. For permutations or projections of permutations (7i,ie [fc], of lengths nii with m < < 
••■ < TOi < m + 1, the interleaved vector is defined as a = ai o a2 ° ° o'k, consisting of elements 

'^U) = '^^{\J/k]), l<J<Em, (7) 

i=l 

where i = j (mod k). For a class of k codes Ci, let 

Ci o ••■ oCfc = {fji o ■•• oCTfe ■■aieCi,ie [k]}. (8) 
For example, for vectors a and tt of length m, we have 

ao7r= (a(l),7r(l),a(2), 7r(2), -,(7(771), 7r(m)) (9) 
and for vectors a and tt of lengths m and m - 1 respectively, we have 

ao^=(a(l),4l),a(2),42),-,a(m)). (10) 

To construct a code capable of correcting a single right-translocation, we partition [n] into two sets, Pq 
and Pi, of sizes [n/2\ and [n/2], respectively. Let Ci be the set of even permutations of elements of Pi, for 
i = 0,l- 

Proposition 19. The interleaved code Cq o Ci corrects a single right-translocation error. 

Proof. Suppose n is even. The proof for odd n is similar. Furthermore, suppose that Pq and Pi consist of 
even and odd numbers in [n], respectively. Given the permutation w, we want to find the unique tt such 
that ui = TT(j){i,j), i < j. The kth element of w is out of place if fc ^ w (fc) (mod 2). It is easy to see that 

i = fcmin •= minjfc : k ^ Lu (fc) (mod 2)} , 

i.e., i equals the smallest k such that the kth element of uj is out of place. 

Finding j is slightly more complicated since we must consider two different cases. 

Let /cniax •= max {fc : k ^ lu (k) (mod 2)}. First, suppose that j = i (mod 2). One can see that, in this case, 
j = 1 + fcinax- Second, suppose that j ^ i (mod 2). Then, we have j = fcmax- Hence, j € {fcmax, kma.^ + 1}. This 
ambiguity comes from the fact that uj (fcmax) and uj (1 + fcmax) have the same parity. 

Let TTg := w0(fcmaxil) ^ud TTi := a;0(l + fcmaxj 1) ■ Furthermore, let o'^'\ for I e {0,1} and h e {0,1}, be 
the subcode of tt; consisting of elements of Pfi. In words, ttq gives rise to the two component codewords 
cTq^^ and (t['^\ where ctq*'^ consists of even numbers and af^ consists of odd numbers. Similarly, tti gives 
rise to two component codewords ctq^'* and cr^^'' with the same properties as described for ttq. It can be 

easily verified that a^^^ = ctq^'' and that a[^^ = o'|^'' (oj (fcmax) w (1 + fcmax))- That is, a[^^ and crj^'' differ 
only in the transposition of elements w (fcmax) and w(l + fcmax)- Since the underlying subcode is a single 
transposition-error detecting code, only one of the af''^ and a['^ is valid. Thus, tt is unique. □ 

^More precisely, the symmetric group can be partitioned into the alternating group and its coset. 
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The cardinaUty of the interleaved code Co° Ci equals and its rate asymptotically equals 

ln(l/4)+21n[fj! n\nn + 0{n) 



Inn! ?ilnn + 0(n) 



4.2 Translocation errors 



The following construction leads to single translocation-error correcting codes. Assume that n = 3k. We 
start by partitioning [n] into three sets Pq,Pi, and P2, each of size k. We let Q be the set of all even 
permutations of elements of Pi, for i = 0,1,2. A codeword in C is obtained by interleaving codewords ai € d, 
where elements of each codeword ai are placed in positions that are equal to i modulo three. That is, a tt 6 C 
may be written as 7r(j) = at where i = j (mod 3), or equivalently, C = Co ° Ci o C2. The cardinality 

of the code equals (5(f)!) and its rate equals 

31n(l/2)+31n(f)! nlnn + 0{n) ^ 
Inn! nlnn + 0(n) 

The construction above can be easily extended to the case where n = 3k ± 1. 

Proposition 20. The code described above can correct a single translocation error. 



Proof. Suppose that tt is the stored permutation, lo is the retrieved permutation, and that the error is 
the translocation (f>{i,i). If = 1, then (t>{i,j) can be easily identified. Suppose that \i- j\ > 1. The 

translocation 4i{i,j) moves \i - j\ elements of tt one position to the left, provided that i < j, and one position 
to the right, provided that j < i. In either case, at most one element moves in the "opposite direction''. Hence, 
for \i - j\> 1 the direction of the translocation (left or right) can be easily identified. Once the direction of 
the translocation is known, the proof follows along the same lines as that of Proposition [191 D 



The ideas described above can be extended to the case of more than one translocation error. For the case 
of right translocation errors, the generalization is more complicated and it does not readily follow from the 
arguments above. In the generalized scheme, one can construct single translocation-error correcting codes 
for n = 3s, but with a slightly smaller rate than the one presented in this section. 

Example 21. Consider the single translocation-correcting code for n = 12. For this case, we have 
Pi = {1,4, 7,10}, F2 = {2,5,8,11}, and Pq = {3,6,9,12}. 

Suppose that the stored codeword is ct = [1,11,6,10,8,3,7,5,12,4,2,9] and the retrieved word is = 
[1,11,12,6,10,8,3,7,5,4,2,9]. 

Given a, the decoder first identifies elements out of order - in this case, {6, 10, 8, 3, 7, 5}. Since more than 
two elements are out of order, we have \i-j\ > 1. Proceeding as in the previous example, we find kmin = 4 and 
kmax = 9. Note that since we are dealing with a left translocation, the roles of kj-^in and kj^ax are swapped 
and candidate translocations are (pik^ax t f^min) = 0(9,4) and (j){k„-iax,kmin ~ 1) = '/'(9,3). 

Let CTi = o-</>(4,9) = [1,11,12,10,8,3,7,5,6,4,2,9] and 02 = d(j){3,Q) = [1,11,6,10,8,3,7,5,12,4,2,9]. 
For (7i, the three interleaved component permutations are a'^^ = [1,10,7,4], crj^-* = [11,8,5,2], and af^ = 
[12,3,6,9] and for as, they are ct^^^ = [1,10,7,4], of'' = [11,8,5,2], and af^ = [6,3,12,9]. Among these, 
only af^ is an odd permutation, so the transmitted codeword equals 

a = (72 = [1,11, 6, 10, 8, 3, 7, 5, 12, 4, 2, 9]. 

□ 
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5 t-translocation Error- Correcting Codes 



We describe next a number of general constructions for t-translocation error-correcting codes. We start with 
an extension of the interleaving methods from Section |4l 

5.1 Interleaving constructions for non-zero rate codes 

In what follows, we construct a family of codes with translocation distance 2t + 1, length n = s {2t + 1) for 
some integer s > At + 1, and cardinality M = {AH{s,At + 1)) , where A//(s,d), as before, denotes the 
maximum size of a permutation code with minimum Hamming distance d. The construction relies on the 
use of 2t + 1 permutation codes, each with minimum Hamming distance at least 4t + 1. First, we present the 
proposed construction and then prove that the minimum translocation distance of the constructed code is 
at least 2t + 1 . 

For a given n and t, where n = (mod 2t + 1), partition the set [n] into 2t + 1 classes Pi, each of size s, 
with 

Pi = {j e[n]: j = i{inod2t + l)} , ie[2t + l]. (11) 
For example, for i = 2 and n = 45, one has 

Pi = {1,6,-,41} ,P2 = {2,7,-,42} ,P3 = {3,8,-,43},... . 

For i 6 [2< + l], let Ci be a permutation code over Pi with minimum Hamming distance at least At+l. The 
code C is obtained by interleaving the codes Ci, i.e., C = Ci o ■•• o C2t+i, and is referred to as an interleaved 
code with 2t + 1 classes. In the interleaved code, the s elements of Pi occupy positions that are equivalent 
to i modulo 2t + 1. 

The following theorem provides a lower-bound for the minimum translocation distance of C. The proof 
of the theorem is presented after stating the required definitions. 

Theorem 22. Consider positive integers s, t, n= s{2t + V), and a partitioning of [n] of the form of (jlip . 
//, for i 6 [2t + 1], Ci is a permutation code over Pi with minimum Hamming distance at least At + I, then 
C = Ci o ■■■ o C2t+i is a permutation code over [n\ with minimum translocation distance greater than or equal 
to 2t + l. 

Corollary 23. For the code C of Theorem \22\ and distinct a,?: e C, the length of the longest common 
subsequence of n and a is less than n - 2t. 

For convenience, we introduce an alternative notation for translocations. Let the mapping ip : Sn ^ §„ 
be defined as follows. For a permutation a e §„ and a e [n], let il}{a,l)a denote the permutation obtained 
from (7 by moving the element a exactly \l\ positions to the right if £ > and to the left \i I < Q. In other 
words, for any permutation u e S„ and a e [n], 

il^{a,l)(j = a0(cr"^(a),cr"i(a) +£). 

For example, we have V(4, 3)[34251] = [32514] = [34251](^(2, 5). Note that the mapping ip is written 
multiplicatively. Furthermore, with slight abuse of terminology, '>p{a,i) may also be called a translocation. 

Consider cr, tt e §„ with distance do(cr, tt) = m. A transformation from cr to tt is a sequence "01: V'2, "'j V'm 
of translocations such that tt = V'm"'V'2'0i<''- 
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Let &i < &2 < ••• < be the elements of [n] that are not in the longest common subsequence of a and 
TT. Each 6fc is called a displaced element. The set {&i,---,fom} is called the set of displaced elements and is 
denoted by £)((t, tt). 

The canonical transformation from ct to tt is a transformation ipm'"'4'2'>Pi with ip^ = i'ibkj^k) for appropri- 
ate choices oi ik,k e [m]. In other words, the canonical transformation operates only on displaced elements 
and corresponds to the shortest sequence of translocations that transform a to tt. 

As an example, consider a = [1, 2, 3, 4, 5, 6, 7, 15] and tt = [1,3, 4, 5, 6, 2, 7, 15]. The canonical trans- 
formation is -0(2,4) and we have 

^ = V(2,4)a=[l,3,4,5,6,2,7,-,15]. (12) 

In this example, D{a,TT) = {2}. 

Let TT; = ipi---ip2'4'i'^ for 1 < I < m. An element a is moved over an element b in step j if there exists a 
translocation in the canonical transformation -0 = '>p{a,i) such that a is on the left (right) of b in ttj-i and on 
the right (left) side of b in ttj. That is, ip moves a from one side of b to the other side. In the above example 
with ip{2,4), 2 is moved over 4 but it is not moved over 7. 

An element k e [2t + 1] is called a a^n-pivot, or simply a pivot, if no element of is displaced, i.e., 
Pk n I?(cr, vr) = 0. In the example defined by ([T^ . the pivots are 1 and 3. 

For I £ [2t + 1], define P/ as Ui^jPi. Also, recall that for w e S„ and a set P, wp denotes the 
projection of uj onto P. For example, ior t = 1, n = 9, ui = [1, 2, 3, 15], and / = {1,3}, we have 
= [1,3,4,6,7,9,10,12,13,15]. We say that up^ has correct order if for every i,j e I^i < j, elements 
of Pi and Pj appear alternatively in wp-up^ , starting with an element of P;. In the example above, up^ has 
correct order. 

Consider cj e S„ and suppose that top. = [aia2---as]. The elements of the set Pi = {ai,---,as} may be 
viewed as separating subsequences of to consisting of elements not in Pi. That is, we may write 

io = roairia2---asrs 

where the r;'s are non-intersecting subsequences of [n]\Pi. For each I, the subsequence r; is called the l-th 
segment of io with respect to Pi and is denoted by Ri {ui,Pi). Below, such a segmentation is shown for the 
permutation 

UJ = [c364ai&2&3C5a2a3Cia4656iC2a5C4] . 
Each segment is marked with a bracket: 

UJ = [ C364 ai b2b3C5 a2 as ci 04 65&1C2 05 C4 ] 



To better visualize the subsequences in question, we may replace each element of Pi by * and write uj as 

UJ = [ C364 * fo2&3C5 * * Ci * 6561C2 * C4 ] . 

We have, for example, P2 {^,Pi) = [^2^325] and P3 {uj,Pi) = []. 

Definition 24. Consider i,j e [2t + 1], Pi = {ai,---,as}, and uj e §„. Suppose, without loss of generality, 
that ujp. = [aia2---as]. The sequence is defined as follows. 

• If i = j, then a;(j|i) = a;(i|i) equals uj^iy. 

• If J > i, then, for 1 < Z < s, let ujQ^i-^ (l) = Ri (wp-up, , Pi) whenever Ri {uj(^j\i) , P;) has length one, and set 
W(j|i) (l) = e otherwise. Here, e is a special notational symbol. 
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• If j < i, then, for 1 < Z < s, let uj(j\i) (l) = Ri-i {(^PiuPjTPi) whenever has length one, 

and set (0 = e otherwise. 

As an example, iff; = {01,02,03,04,05}, Pj = {&i, &2, &3, ^4, ^5}, j < i, and wp^up^ = [61010262^30364046505], 
the segments oi io^^ jy are [61], [], [6263], [64], and [65], in the given order, and we have = [6iee6465]. 

Proof. fTheorem \2^) Suppose the minimum distance of C is less than 2t+l. Then, for two distinct codewords 
7r,cr 6 C, there exists an w e S„ such that do(7r,aj) < t and do(cr, < t. 

Since ct # tt, there exists k e [2t + 1] such that ttp^. i= ap^ , which implies that d/f (vrp^ , ap^ ) > At + 1. Since 
do(cr, < t, by Lemma B51 there exists I £[2t + 1] of size at least t + 1 such that wp^ has correct order. 

If fc € /, by Lemma [27l d//(crp^ , up^, ) <2t and d^f (ttp^ , wp^, ) < 2t, which together imply dnio'p^ , ttpj. ) < 4f. 

On the other hand, if fc ^ /, by Lemma |27[ for any i e /, d//(fTp^,u'(fc|i)) < 2t and d/f (ttp^, , w^^ii)) < 2t, 
which again imply d^CcPfc , ^rp^. ) < At. 

Hence, by contradiction, the minimum distance of C is at least 2t + l. □ 

Lemma 25. Consider the interleaved code C of Theorem \2Si and let cr e C Furthermore, let lo e §„ he 
such that do(cr, 1^) < t. There exists at least one subset I c \2t + 1] of size at least t + \ such that ujp^ has 
correct order. 

Proof. There are at most t displaced elements, and thus, at most t classes containing a displaced element. 
Hence, there exist at least 2t + l- t = t+ \ classes without any displaced elements and, consequently , at least 
t + 1 (T, cli -pivots. Let / be the set consisting of these pivots. It is clear that uj obtained in this way has 
correct order which proves the claimed result. □ 

Lemma 26. For all positive integers s and t and all permutations a,LU e §„, with n = (2t + l)s, if i* is a 
a,UJ-pivot, then for j e [2t + 1], 

clif(cr(jji*),t^(jji»)) < 2do{a;uj). 

Proof. Assume do{a,Lu) = m and let tpm'"tp2'4'i be the canonical transformation from a to cj, so that uj = 
'4'm'"tp2'4'i'^- We prove the lemma by induction on m. Clearly, if to = 0, then 

d//((To|i.),wo|i.)) =0. 

Let TT = ■ As the induction hypothesis, assume that 

dH(o-o|,»),vr(j|,.)) < 2(to- 1). 
By the triangle inequality, it suffices to show that 

d//(7r(j|i*),u;(j|i.)) < 2. (13) 

Suppose V'm = "0(^1 ^) so that u = ip„i tt. Since i* is a pivot, we have b i Pi* . We consider two cases: b i Pj 
and b e Pj. First, suppose h i Pj. Since b i Pi* u Pj, we have irp.^uPj = ^P-tuPj and thus d/f (7r(j|j»),a;(j|i»)) = 0. 

On the other hand, suppose 6 e Pj. Then, 6 appears in i?fc (ttp.^uPjj^i*) ofvrp.^up^ and in _R; (^ujp.^uPj , Pi*) 

of 

^p^*uPj 1 for some /b, /. Xhe only segments Rffected by the translocation if^m are Rk {T^p-tuPj , Pi*) and 
Ri (wp,,uPj,-Pi»): and thus, for p e [2t + l]\{l,k}, we have Rp {np^^uPj , Pi*) = Rp {oLip.,uP,,Pi*)- Hence, for 
pe [2t + l]\{l,k}, we find 7r(j|i,)(p) = implying that dH{TT{j\i*),uj(j\i*)) < 2. □ 
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Lemma 27. Consider the interleaved code C of Theorem \2'A Let a e C and uj e §„ such that do((T, a;) < t. 
If I '= [2t + 1] is of size at least t + 1 and up^ has correct order, then 

1. for each i e I , dn (cp. ,ujp.) <2t and, 

2. f oriel and j ^ I, dn (ctp^. ,a;(jjj)) < 2t. 

Proof. Since there are at most t classes containing displaced elements and / has size at least t + 1, there 
exists a pivot i* € /. Then, by Lemma [26l 

But (T(i|i») = CTp. and, since ujpj has correct order, = ^Pi- This completes the proof of part 1. 

To prove the second part, we proceed as follows. Let r(a, 6) denote the transposition of elements a and b. 
Assume '0m'"V'2'0ii with m = do(o',aj), is the canonical transformation from cr to oj so that ui = ijjm'"tp2'4'iO'. 

We first show that ?Am---V'2V'i niay be decomposed into four parts as follows 
with t' + i^') + 1^^^ = m such that 

^-^ = ^(4,4), aUP.^Pj,ke[t'], (14) 
Tk = T{ak,bk), afc 6 Pi,6fc 6 Pi,fc 6 [t^] , 

4'^ = 4ai^\ii'^y ai?^eP„ke[t(^^], 

and such that no t/;^*'' moves a^'^ over an element of Pi* . 

It can be easily verified that any two translocations "(/"(o, (i) and '4'i^^ ^2) "commute". That is, for any per- 
mutation TT, we can find translocations 'i(){a,iz) and £4) such that tp{a,£i)tlj{b,£2)T' = '4'{b,t4)'4'{a,lz)'K. 
Thus, we have the decomposition 

with t' + i(') + 1(^) = m such that 

^-^ = ^(4,4), <^P,uP„fc6[t'], 

V^^' = V«4'), 4:6P„fc6[t«], 

4^^ = ^(o^,(^l 4^)eP„fce[tO)]. 

Furthermore, it is easy to see that we may write as ^^^(}-,---ip''i^ Tt^---Ti with 

Tk = T{ak,bk), akePi,bkeP^,ke[tr], 

such that no ■f/;^,*' moves a^'^ over an element of Pi* . Hence, for any permutation uj one can write a decom- 
position of the form 

Let uj' = Tt^---Tiip[r--ipiU, and oj'-*^ = lu' , so that lu = By the triangle inequality 

dH {(Jp, , WOK) ) < (fTp^ , ) + dn , t^g,) )+dH (wg^^ , ) . 
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It is clear that dn (fPj , ~ ^- Next, consider and its transform induced by the translocation 
'ip^^\k 6 [t^*-*]. Note that uj'p^ has correct order. Since no translocation moves a^*-* over an element 
of Pi*, each ip^'^ moves a^*^ over at most one element of Pj. Thus, each ?/;[.'^ can modify at most two segments 
and we have that Ah - Si^*-*. 

Similar to the proof of Lemma [25l each -0^"'^ modifies at most two segments and thus (V'ljj'i) i '^(jji)) ^ 
2t(^\ Hence, 

d H (crp, , ifoio ) < + 2t(') + 2^^^) < 2m. 

□ 

The rate of translocation correcting codes based on interleaving may be easily estimated as follows. The 
cardinality of the interleaved code of length n and minimum distance at least d is at least {^Ah ([^J , 2(i - l)) , 

for odd d, and (Ah([^J ,2d+ l)) , for even d. where, as before, Anin^d) denotes the largest size of 
a permutation code of length n and minimum Hamming distance d. Hence, the asymptotic rate of the 
interleaved code equals 

ln|C| d(n)ln^ff(^,2d(n)) 

lim = lim 1 

Inn! Inn! 

ln^H(rf(^,2d(n))^(^) ln(n/d(n))! 



lim ■ 



ln(n/(i(n))! Inn! 

n In n - n In d + O (n) 



/ , d^(n)\ 
(l-2hm^j 



lim ■ 



n 



Inn + O (n) 



Assuming that Hamming codes with minimum distance d ^ are used for interleaving, one obtains a 
translocation error-correcting code of rate 

Inn! 

In the next section we describe a simple modification of the interleaving procedure, which, when applied 
recursively, improves upon the code rate 1-/3. 

5.2 Recursive Constructions 

The interleaving approach described in the previous subsection may be extended in a straightforward manner 
as follows. Rather than interleaving permutation codes with good Hamming distance, as in Section [5.11 one 
may construct a code in the translocation metric by interleaving a code with good translocation distance 
and a code with good Hamming distance. Furthermore, this approach may be implemented in a recursive 
manner, reminiscent of the Plotkin construction for binary codes. In what follows, we explain one such 
approach and show how it leads to improved code rates as compared to simple interleaving. 

We find the following results useful for our recursive construction method. 

Lemma 28. Let cr, tt e §„ he two permutations, such that do(cr, tt) = 1. Then, there exist at most three 
locations i, i e[n-l\, such that for some j = j{i) e [n - 1]; 

• a{i) = 7r(j); 
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• a{i + 1) i= Tr{j + 1). 



Proof. Suppose tt = a4>{ii,i2). The proof follows from the shuple fact that when applymg a translocation 
</>(«!, 12) to tr, the locations i described above equal 

ii,i2 ~ 1, and 12 if ii > 12 > 1, 

ii and 12 if ii> 12 = 1, 

ii - 1, 12 - 1, and 12 if < Z2 < 

ii - 1 and 12-1 if ii < ^2 = n. 

□ 

Corollary 29. Lef u, tt e Sn he two permutations, and assume that there exist a > different locations i, 
i e [n - 1], such that a{i) = 7r(j), but a{i + 1) + Tr{j + 1) for some j e [n - 1]. Then, do{a, tt) > [a/3] . 

For an integer p> 1, let fi= [12- --p] and let cti, (T2 e + 1, 2p - 1}). Note that 



/XOfJi = (l,cri(l),2,CTi(2),---,p- l,CTi(p- l),p) 
HO(T2 = (l,Cr2(l),2,(T2(2),---,p- 1,CT2(P- 

Theorem 30. For ^,ai, and 02 described above, i/ cI//(cti, 172) > then 

doin o (Ti,/i o (72) > \2d/3] . 



(15) 



Proof. Let tti = o ai and 7r2 = fi o a2. Below, we show that the number of indices £ in tti with respect to 
TT2 that satisfy the conditions described in Lemma [28] is at least 2d. Then, the claim of the theorem follows 
when we apply Corollary 1291 with a = 2d. 

Assume that (Ji{£) + o'2{£) for some £ e [p - 1]. For each such £, the two indices 2£-\ and 2£ can both 
serve as index i in Lemma 

1. Wehave7ri(2^-l) = 7r2(2f-l) = £, yet 

ax{£) = nx{2£) + n2{2£) = cj2{£). 

2. Let j 6 [p] be such that tti{2£) = 7r2(2j). It is easy to see that j + £. Then, 

£+1 = 7ri(2£+l) + 7r2(2j + 1) = j + 1 . 

□ 



Let fxoC = {p o a : a e C} . The following corollary follows from Theorem [30l 

Corollary 31. For integers n and p with n = 2p-l, let pL = [12- ■ -p] and suppose C ^S{{p + l,---,n}) is a 

code with minimum Hamming distance at least ^d./2. Then fioC is a code in §„ with minimum translocation 
distance at least d and with size \C\. 

Hence, for odd n, we can construct a translocation code with length n, minimum distance at least d, and 
size Ah ((""1)/2, [3''/2]) . This can be easily generalized for all n to get codes of size 



1, 



sd- 

~2 
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By assuming that the permutation code in the Hamming metric is capacity achieving, the asymptotic 
rate of the constructed code becomes 

lnAH([f1-l,[^j) ^ ^. In ([f 1 - 1, ln[t]! 



Inn! ln[f]! Inn! 

1 3 d{n) 1 3 , 
= lim = 0. 

2 2 n 2 2 

Therefore, this code construction incurs a rate loss of 1/2(1 +5) when compared to the capacity bound, which 
in this case reads as 1- S. 

The final result that we prove in order to describe a recursive interleaving procedure is related to the 
longest common subsequence of subsequences of two sequences and the minimum translocation distance of 
interleaved sequences. 

Lemma 32. For cr, tt e §„ and P E [n], we have 

do (cr, tt) > do (crp,7rp) + do (o-Q,7rQ) 

where Q = [n] \P. 

Proof. Without loss of generality, assume that a is the identity permutation. It is clear that l{Tr) < 1{ttp) + 
ling). Hence, 

do (cr, tt) = n - /(tt) 

>n-l{TTp) -1{ttq) 

= \P\-l(np) + \Q\-l{7rQ) 

= do {<Tp,np) + do {(JQ,nQ) ■ 

□ 

Lemma 33. For sets P and Q of sizes p - 1 and p respectively, let Ci E §(-P) he a code with minimum 
translocation distance d and C[ c S{Q) a code with minimum Hamming distance 3d/2. The code C[ o Ci = 
{cr o TT : cr 6 C{ , TT 6 Ci} has minimum translocation distance d. 

Proof. For CTi,cr2 6 C[ and 7ri,7r2 € Ci, we show that do (cri o tti, CT2 o 7r2) > d. 
The case cri = cr2 follows from a simple use of Theorem [30l 

Assume next that ai + a^. Then by Lemma I32| do (cri o tti , cr2 o 712) > do (ai , CT2 ) ^ and this completes 
the proof. □ 

We now turn out attention to a recursive code construction based on the findings outlined above. 

Let a = 3/2. For a given n, set P = {l,---,[fl} and set Q = {[f] + 1, 2 f f ] - l}. Suppose C[ £ S(P) 
is a code with minimum translocation distance d and Ci £ §((5) a code with minimum Hamming distance 
ad. Assuming that permutation codes with this given minimum Hamming distance exist, we only need to 
provide a construction for C[ . An obvious choice for C[ is a code with only one codeword. Then, C = C[oCi 
is a code with minimum translocation distance d and cardinality 
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The gap to capacity may be significantly reduced by observing that C[ does not have to be a code of 
cardinality one, and that C[ may be constructed from shorter codes. 

To this end, let C{ = o C2 where is a code of length with minimum translocation distance d, 
while C2 is a code of length f -j] - 1 and minimum Hamming distance ad. 



By repeating the same procedure k times we obtain a code of the form 

{{{CioCk)oCk-l)o-)oCi 



(16) 



where each Ci, i < fc, is a code with minimum Hamming distance ad and length [||] - 1 and C'f. is a code with 
minimum translocation distance d and length f^r]- Since each d is a permutation code in the Hamming 
metric with minimum distance ad, we must have j"^] - 1 > ad. To ensure that this condition is satisfied, in 
(fT6| , we let k be the largest value of i satisfying ^ - 1 > ad. It is easy to see that k = [log ^^^^^ J. Furthermore, 
we choose to consist of a single codeword. 



The asymptotic rate of the recursively constructed codes equals 

k 



2 K / 

inn! \ 



1 



i=l 
k 



Inatl-l)! 
lim|:(l-^)2- 

im|:(2--^^^) 

" ) 



Inn! 



(17) 



lim|l-2 



ad{n)k 



log^ 
ad 



where the last step follows from limfc = [log ^J. Note that this rate is roughly equal to 1 - ch5 (l + log ^). 



5.3 Permutation Codes in the Hamming Metric 

In the previous subsection, we demonstrated a number of constructions for translocation error-correcting 
codes based on permutation codes in the Hamming metric. There exists a number of constructions for sets 
of permutations with good Hamming distance. For example, in |171 1161 114| [7] constructions of permutations 
in §n using classical binary codes were presented, while other constructions rely on direct combinatorial 
constructions [3, 8J. 

In the former case, if C is a binary [n, an, /3n] code, the construction applied to C yields a subset of 
§„ of cardinality an, with minimum Hamming distance f3n. This construction and constructions related 
to it may be used in the interleaved permutation code design, resulting in sets of permutations in S„ of 
cardinality exp0(n) and minimum Hamming distance Q(n). These permutations may consequently be used 
to construct permutation codes in §2n with exp0(n) codewords and minimum translocation distance Q(n). 

Ill what follows, we describe a simple method for constructing sets of vectors of length m > over [n] 
such that all entries of the vector are different, and such that the minimum Hamming distance between the 
vectors is large. In other words, we propose a novel construction for partial permutation codes under the 
Hamming metric, suitable for use in the recursive code construction described in the previous subsection. 
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The idea behind the proof is based on mapping suitably modified binary codewords in the Hamming 
space into partial permutations. For this purpose, let C be a binary [N, K, D] code, and for simplicity of 
exposition, assume that n is a power of two. Let C. We construct a vector x = x(c) e ([n])'", where x 
is a mapping described below, as follows: 



1. Divide c into m binary blocks Ci,C2,---,Cm of lengths log2n-log2m each. Again, for simplicity, we 
assume that m is a power of two. 

2. For each block Ci, i e [rn], construct a vector Xi of length \0g2n according to the rule: the first 
log2 n - log2 m bits in Xi equal Cj, while the last log2 m bits in Xi represent the binary encoding of the 
index i. Note that the integer values represented by the binary vectors Xi,X2, ■■%x,„ are all different. 

3. Form an integer valued vector x = x(c) of length m over [n], such that its i-th entry has the binary 
encoding specified by Xi- Observe that all the integer entries of such a vector are different. 



Now, take two vectors a, c € C, such that their Hamming distance satisfies djy (a, c) > D. Let x = x(a) and 
V - X(c) be the corresponding vectors of length m over [n] constructed as above. Then, there exist at least 
^/iog2 n-iog2 m blocks of length log2 n that are pairwise different. Therefore, the corresponding ^/loga n-iogj m 
entries in x and y are pairwise different as well. 

Consider the set of vectors 

S' = {x{c) : C€C} . 
It is straightforward to see that the set <S' has the following properties: 

1. For any a; e <S', all entries in x are different. 

2. For any x,y &S', x + y, the Hamming distance satisfies dHix,y) > j . 



The set S' can be used similarly as the set <S„ in the basic construction to obtain codes over Sn+m with 
translocation distance 



O 



I 5 ] 



Note that in this case, only m numbers in the range {n+ 1, n + 2, •••,n + m} are inserted between the numbers 
in [n] , while the Hamming distance of the vectors is preserved. 

Lemma 34. The parameters N, n and m are connected by the following equation: 

N = m log2 n-m log2 m = m log2 — . 

m 



From this lemma, if we take m = ^n, then N = ^n. 
where a > and /3 > are constants, we obtain a set 
corresponding code C is able to correct n translocation 



By taking a code C with parameters [in,an,/3n], 
S' of size 2"" and Hamming distance 8(n). The 
errors, and it has 2"" codewords. 



5.4 Decoding of Interleaved Codes 

An efficient decoder implementation for the general family of interleaved codes is currently not known. For 
the case of recursive codes, decoding may be accomplished with low complexity provided that the Hamming 
distance of the component permutation codes is increased from 3cy2 to 2d. 
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For simplicity of exposition, we assume n to be odd, more precisely, of the form n = In' + 1. The case of 
even n may be handled in the same manner, provided that one fixes the last symbol of all codewords. 

Let a = [1, (7 (1) , 2, (7 (2) , (7 (n') , n' + 1] e C be the stored codeword and let vr e §„ be the retrieved 
word. 

For i 6 [n'], denote by the substring of tt that starts with element i and ends with element i + 1. If 
i + 1 appears before i in tt, then is considered empty. For i € [n], let 7r(i) = u if contains a unique 
element of P2, say u. Otherwise, let 7r(i) = e. 

Lemma 35. The permutation tt differs from a in at most 2do (c, vr) positions. 

Proof. Let t = do (a, tt). There exists a sequence 0i, (j)2, (pt of translocations such that tt = a(j)i(f>2'"4't- For 
i € {0,---,t}, let TTi = fj(j)i(j)2'"4'i ^iid let Li be given as 

= {j\3k < i : TTfe (j) + dk (j)} . 

The set Li may be viewed as the set of elements displaced by one of the translocations 02, Note 
that, for each i, Li £ 

To prove the lemma, it suffices to show that \Lt\ < 2t, since {j|7r(j) + o'(j)} £ Lt. 

Let Lq = 0. We show that < + 2 for i e [t]. 

The translocation (pi either moves an element of Pi or an element of P2. First, suppose that it moves an 
element j of Pi. Then (pi can affect only the substrings s^Si and sj'"^ of iii-i. In the second case, assume 
that (pi moves an element of P2. It can then be easily verified that at most two substrings of tt^-i, namely 
sj^'"^ and sjj^"\ may be affected by the given translocation. Hence, \Li\ < + 2. □ 

Assume now that C £ §„ is an interleaved code of the form 

C = C'ioCi 

where C( = {[1, 2, n' + 1]} and Ci is a permutation code over the set P = {n' + 2, 2n' + 1} with minimum 
Hamming distance At + 1. 

Let a e C he the stored code word and tt e §„ be the retrieved word. Assume that do (cr, tt) < t. The first 
step of the decoding algorithm is to extract tt from the permutation tt. By Lemma [35l we have dn (<7, tt) < 2t. 
Since Ci has minimum Hamming distance 4i + 1, a can be uniquely recovered from tt. Thus, decoding is 
accomplished through Hamming distance decoding of permutation codes, for which a number of interesting 
algorithms are known in literature [30l [33l [27j . Note that this simple decoding method may be used on a 
recursive construction of depth larger than one, following the same rules as described above. 

We briefly comment on the rate loss introduced into the interleaved code construction in order to perform 
efficient decoding. To compute the rate of the code C, note that 

ln|C| _ ln|Ci| _ ln|Ci| Inn'! 
Inn! Inn! Inn'! Inn! 
Assume that the Hamming code Ci is capacity achieving, and that therefore it holds that 

limi^.l-41iml 
Inn'! n 

By observing that, 

Inn'! (n/2)lnn + 0(n) 1 

lim = lim — = - 

Inn! nlnn + 0(n) 2 
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Figure 3: Rates of several constructions v. distance for n = 150. 



one arrives at 
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ln|C| 
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1 
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Inn! 2 n 2' 

Note that a capacity achieving code for translocation errors has rate 1 - ^6. 

For the recursive construction described in the asymptotic rate of the efficiently decodable codes 

outlined above equals 1 - 2"!-'°^ ^-1 - ad [log ^J, with a = 2. 

Remark: Permutation codes in S„, correcting adjacent transposition errors, were thoroughly studied 
in [1]. We note that these codes can also be used to correct translocation errors. Indeed, every translocation 
can be viewed as a sequence of at most 2n-l adjacent transpositions. Therefore, any code in §„ that corrects 
f{n) adjacent transpositions (for some function f{n)) can also correct 0{f{n)/n) translocation. 

It was shown in Theorem 3.1 of Ijj that the upper bound on the rate of the code correcting 0{n^) adjacent 
transpositions is zero. Such code can also be used to correct 0{n) translocation errors. In comparison, the 
interleaving constructions described above can also correct 0{n) translocation errors, yet their rate is strictly 
larger than zero. 

The non-asymptotic and asymptotic rates of various code families in the translocation distance are 
compared with the capacity formulas in Figures [3] and |4l 

Note that the gap from capacity of the constructions presented in the paper is still fairly large, despite the 
fact that the codes are asymptotically good. This result may be attributed to the fact that the interleaving 
construction restricts the locations of subsets of elements in a severe manner. Alternative interleaving 
methods will be discussed in a companion paper. 



In what follows, we describe a method of Beame and Blais [2J that provides translocation codes with 
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Figure 4: Asymptotic rate comparisons, 
minimum distance proportional to n - o(n). This covers the zero-capacity domain of our analysis. 

5.5 Zero-rate codes 

We present two constructions based on the longest common subsequence analysis. The first construction is 
based on Hadamard matrices, while the second is based on a number theoretic method the details of which 
may be found in [5]. 

Assume that a Hadamard matrix of order k exists. To explain this construction we consider permutations 
over the set {0, 1, n - 1}. Furthermore, the positions in each permutation are also numbered from to 
n - 1. 

Let s = \n^/^''^^^'\. For a e {0, 1, n - 1}, we denote the representation of a in base s by aia2---ak-i where 
ai is the most significant digit. Let H he a Hadamard matrix of order k with rows and columns indexed 
by elements in the set {0,l,---,k - 1}. Without loss of generality, assume the first row and column of H 
are all-ones vectors. The set {Tr^lj^j^ of permutations is constructed by defining the mth element of tt^, for 
m = 0, 1, s*^"^ - 1, as follows. Let m = my-mk-i, and let the mth element of tt^ equal 

■Ki (to) = aia2---ak-i 

where, for j e {0, 1, . . . , fc - 1}, 

^ _ {rrij, if Hij = 1, 

I S - 1 - TTlj , if Hij = -1. 

The length of the longest common subsequence of any two permutations of {iTi} is at most s*^^^"^. The 
permutations obtained in this way have length s*^"^. Consequently, the minimum distance of the code is at 
least s'=-i - s'=/2-i. Note that if s''-^ > n, we can arbitrary delete elements from each permutation to obtain 
a set of permutations each of length n. 
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Figure 5: Permutation codewords based on Hadamard matrices. 



As an example, consider n = 27 and fc = 4. We have 
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and s = 3. Four codewords of the code based on this Hadamard matrix are plotted in Figure [5751 

Another construction based on [2J holds for 3 < fc < \/n, leading to k permutations with minimum 

1/3 

translocation distance at least n - 32 (fcn) . The construction is described below. 

Let s = (n/fc^) and s' = (nk)^^^ and assume s and s' are integers. Consider the integer lattice 
[s] X [s' - 1] X [s' - 1] denoted by X. Let ipn.k '■ [n] X he such that {x, y, z) = (j)k,n {'m) if and only if 

(m- 1) = (x - I) + s (y - I) + ss' (z - 1) . 

Let the smallest prime larger than 4s' be denoted by p. For i e [3] and j e [k] define gij : X ^ Z a.s 

93,] {x,y,z) = f + 2jy + 2z (mod p) , 
g2,j{x,y,z) =jx + y, 
51 J {x,y,z) = X, 

and let hj = {gij o0n,fc,g2.j °0n,fc,33.j °4'n,k)- Each hj defines a total ordering <j on [n\. Let TTj be the 
permutations that is sorted with respect to <j. That is, 

(1) <] TTj (2) <j ■■■<j Tlj{n). 

1/3 

The set {tti, 7r2, Tr^} of permutations has minimum translocation distance at least n- 16(nfc) 

If s and s' are not integers, a set of k permutations over [n] with minimum translocation distance at 
least 32(rifc)^^^ may be constructed using a similar approach. 
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6 Conclusion 



We introduced the notion of translocation error in rank modulation systems. Translocation errors may be 
viewed as generalization of adjacent swap errors frequently encountered in flash memories. We demonstrated 
that the translocation distance between two permutations is a linear function of the longest common subse- 
quence of the permutations and derived asymptotically tight upper and lower bounds on the code capacity. 
Furthermore, we presented a number of constructions for translocation error-correcting codes based on inter- 
leaving permutation codes in the Hamming metric and deletion-correcting codes in the Levenshtein metric. 
Finally, we exhibited a low-complexity decoding method for a class of relaxed interleaved codes of non-zero 
asymptotic rate. 
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